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Abstract: Clustering logs have been the subject of much study in recent literature. They 
■ are a class of large logs which arise for non-global jet-shape observables where final-state 

particles are clustered by a non-cone-like jet algorithm. Their resummation to all orders 
is highly non-trivial due to the non-trivial role of clustering amongst soft gluons which 
\ results in the phase-space being non-factorisable. This may therefore significantly impact 

the accuracy of analytical estimations of many of such observables. Nonetheless, in this 
paper we address this very issue for jet shapes defined using the kt and C/A algorithms, 
^ ■ taking the jet mass as our explicit example. We calculate the coefficients of the Abelian 

a 2 L 2 , a^L 3 and aj.L NLL terms in the exponent of the resummed distribution and show 
that the impact of these logs is small which gives confidence on the perturbative estimate 
without the neglected higher-order terms. Furthermore we numerically resum the non- 
global logs of the jet mass distribution in the kt algorithm in the large- N c limit. 
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1. Introduction 

Event and jet shapes have played a significant role in the development and testing of QCD 
and its parameters (see ref. [1] for a review), ranging from the confirmation of vector-boson 
nature of gluons [2] to very accurate measurements of the coupling and colour factors of 
its underlying theory [3,4], as well as tuning Monte Carlos [5-7,9]. They have also been 
used to extract moments of the non-perturbative coupling, as reviewed in refs. [1] and [4] 
(and references therein). 

Despite the fact that the LEP collider was a clean environment to "measure" QCD 
using event/jet shapes, they will still be an important tool at the LHC, particularly jet 
shapes and jet substructure which are expected to be invaluable tools in the search for new 
physics due to the final state being dominantly jets [7,9]. For instance there has recently 
been much interest in the concept of substructure of jets, used to identify massive boosted 
electroweak objects whose hadronic decay products tend to cluster into the same fat jet, 
e.g, [8-13]. The invariant jet mass, being central to jet shapes and phenomenologically 
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most useful, provides a simple way of signal/background discrimination. Whilst the jet- 
mass distribution of QCD jets is, away from the Sudakov peak (at low values), featureless, 
that of heavy-particles decay products has "bumps" [12]. 

Given their unequivocal importance at the LHC, event/jet-shape observables have re- 
ceived substantial progress over the last few years, both at fixed order and to all orders 
in the perturbative expansion. For "global" observables, Next-to-Leading Log (NLL) re- 
summation, NLO fixed-order calculations and their matching have been available for a 
number of event/jet shapes for quite a while [14, 15]. In fact, calculations of arbitrary such 
observables can now be performed automatically using programs such as CAESAR [16]. The 
state-of-the-art perturbative calculations is up to NNLO for fixed order [17] and up to 
N 3 LL accuracy 1 for resummation [14,15]. Moreover, non-perturbative calculations have 
seen noticeable advancements, especially in disentangling various components of these ef- 
fects such as the underlying event and hadronisation [18]. 

However there is a class of observables termed "non-global" [19, 20] which suffer from 
large logs, which are absent for global observables and which have not been fully resummed 
even at NLL. The invariant jet mass, with or without a cut on inter-jet energy flow, is a 
typical example. Resummation of non-global logs, valid only in the large- N c approximation, 
has long been available for a wide range of observables which are linear in soft momenta 
and progress has been made for jet-defined quantities which are not linear in soft emissions, 
e.g. gaps-between-jets energy flow [21,22] and jet mass with a jet veto [23]. Furthermore 
it was found in [21,22] and [23] that applying a clustering algorithm on the final-state 
partons seems to significantly restrict the phase-space responsible for non-global logs, thus 
producing a sizeable reduction in their phenomenological impact. 

In the presence of a jet algorithm, other than anti-/ct [24], non-global observables suffer 
from large logs in the Abelian part of the emission amplitude. These are referred to as 
"clustering logs" [23,25,26]. The clustering logs were first computed at fixed order in 
ref. [27] and subsequently partially resummed in ref. [22] for away-from-jets energy flow. 
There, however, the resulting exponent of the resummed distribution has been written as 
a power-series in the radius parameter of the jet algorithm (R) starting from R 3 . Thus for 
a typical jet radius (R ~ 1) it was sufficient, for an accurate approximation, to compute 
the first couple of terms (0(R 3 ) and 0(R 5 )), as the series rapidly converge. Excellent 
agreement was noticed when compared to the output of the Monte Carlo (MC) developed 
in [19]. 

Clustering logs arise due to mis-cancellation between real emissions and virtual cor- 
rections. This mis-cancellation results from re-clustering of final-state configurations of 
soft gluons. Unlike the single-log Ef distribution, resummation of clustering logs in the jet 
mass distribution cannot simply be written as a power-series in the jet radius. Collinear 
singularities at the boundary of a small- i? jet yields large logs in the radius parameter, 
which appear to all orders in a s [23,28]. Note that the jet veto distribution, studied in 
the latter references, disentangles from the jet mass distribution to all orders [29] and has 
a non-global structure analogous to the E t distribution. That is, the coefficients of both 

1 We speak of the logarithmic accuracy in the exponent of the distribution. 
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non-global and clustering logs are identical for the jet veto and Et distributions. The 
arguments of the logs are different though [23]. 

The clustering logs have recently been the subject of much study both at fixed order 
and to all orders. However there has been no full resummation to all orders and it was 
recently suggested that it is unlikely that such logs be fully resummed even to their leading 
log level, which means NLL accuracy relative to leading double logs [25,26]. In this paper 
we address this very issue for the jet mass distribution. 

In ref. [28] the resummation of the jet mass distribution for e + e~ annihilation, in the 
anti-fct algorithm [24], was performed to all orders including the non-global component 
in large-iV c limit. Employing the anti-/^ clustering algorithm meant that the observable 
definition was linear in transverse momenta of soft emissions and therefore the resummation 
involved no clustering logs. In the same paper the authors also carried out a fixed-order 
calculation for the jet mass distribution at 0(a 2 ) employing the kt algorithm [30,31] in 
the small- R limit. The result of integration yielded an NLL term, a 2 L 2 , that is formally 
as important as Sudakov primary NLL terms. 

In the framework of soft-collinear effective theory [32] the authors of ref. [25] confirmed 
the findings of ref. [28] and computed the full -R-dependence of the jet mass distribution at 
0{oq). They also pointed out the unlikelihood of the resummation of the clustering logs 
to all orders. 

We present here an expression for the all-orders resummed result of clustering logs to 
NLL accuracy in the kt and C/A algorithms [30,33] for the jet mass distribution. The 
logs that we control take the form TnOt^U 1 (n > 2) in the exponent of the resummed 
distribution, and we only compute J~2, J 7 ^ and J-4 in this paper. By comparing our findings 
to the output of the Monte Carlo of ref. [19] we show that missing higher-order terms are 
negligible. We also show that the impact of the primary-emission single clustering logs 
is of maximum order 5% for typical jet radii. Furthermore we estimate the non- global 
(clustering-induced) contribution to the jet mass distribution in the large- N c limit using 
the Monte Carlo of ref. [19] in the case of the kt algorithm. 

This paper is organised as follows. In the next section we define the jet mass and 
show how different algorithms affect its distribution at NLL level. We then start with the 
impact of kt and C/A algorithms on the jet mass distribution at 0(a 2 ), thus confirming the 
findings of ref. [28] . In section [| we unearth higher order clustering terms and notice that 
they exhibit a pattern of exponentiation. By making an anstaz of higher-order clustering 
coefficients we perform a resummation of the clustering logs to all orders in sec. |5| and 
compare our findings to the output of a Monte Carlo program in sec. ^. We also perform a 
numerical estimate of the non-global logs in the large- N c limit for the jet mass distribution 
in the kt algorithm in sec. 6.1. Finally we draw our conclusions and point to future work. 



2. The jet mass distribution and clustering algorithms 

Consider for simplicity the e + e~ annihilation into two jets produced back-to-back with 
high transverse momenta. We would like to study the single inclusive jet mass distribution 
via measuring the invariant mass of one of the final-state jets, M?, while leaving the other 
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jet unmeasured. One can restrict inter-jet activity by imposing a cut Qq on emissions in 
this region [34,35]. For the purpose of this paper we do not worry about this issue because 
the effect of this cut has been dealt with in the literature [22, 23, 28] and can be included 
straightforwardly. 

The normalised invariant jet-mass-squared fraction, p, is defined by: 

iej J V i / 

where the sum in the numerator runs over all particles in the measured jet, defined using 
an infrared and collinear (IRC)-safe algorithm such as kt or C/A algorithm [30,33]. At 
born level the jet mass has the value zero and it departs from this value at higher orders. 

In general sequential recombination algorithms [24,30,33,36] one defines distance mea- 
sures for each pair of objects 2 {i,j} in the final state as [36]: 

dij = 2 min (e* p , £f ) (1 - cos , (2.2) 

and for each parton i a distance, from the beam B, 

d lB = E 2p R 2 . (2.3) 

The values p = —1,0, 1 correspond respectively to the anti-kt, C/A and kt algorithms. A 
typical algorithm takes the smallest value of these distances and merges particles i and 
j into a single object when is the smallest, using an appropriate combination scheme 
such as addition of four-momenta. If instead a distance diB is the smallest then object i is 
considered a jet and is removed from the list of final-state objects. This procedure is then 
iterated until all objects have been clustered into jets. We may write 2(1 — cos 0y) ~ Ofj in 
the small-angles limit where jets are narrow and well-separated to avoid correlations, and 
hence contamination, between various jets. 

In the kt algorithm, softest partons are clustered first according to the above mentioned 
procedure, while in the anti-kt algorithm clustering starts with the hardest partons. In the 
C/A algorithm only angular separations between partons matters so clustering starts with 
the geometrically closest partons. In effect, clustering induces modifications to the mass of 
the measured jet due to reshuffling of soft gluons. Only those gluons which end up in the 
jet region would contribute to its mass. Different jet algorithms would then give different 
values of the jet mass for the same event. 

The global part of the integrated resummed jet mass distribution in the kt (C/A) 
algorithm is related to that in the anti-kt algorithm by: 

' f? 2 \ / p2 s 



P J VP 



exp 



k t (C/A) i R 2 



(2.4) 



P 

where £ antl-kt resums the leading double logs (DL) (due to soft and collinear poles of the 
emission amplitude) as well as next-to-leading single logs (SL) in the anti-fc( algorithm. 



2 Objects refer to actual tracks, cells and towers in the detector and to particles/partons in perturbative 
calculations. 
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The reader is referred to ref. [28] for further details about this piece. The function c/2, A 
contains the new large clustering single logs due to A^(C/A) clustering. 

In addition to the global part, each of the distributions £ antl-kt ' fct ' C / A receives its own 
non-global NLL contribution factor S(R 2 /p). In the anti-fcj algorithm the resummation 
of the non- global logs in the large- N c limit for the jet mass distribution was estimated in 
ref. [28]. The result was actually shown to coincide with that of the hemisphere jet mass 
case. The latter has been available for quite a while [19,20]. For kt clustering the effect 
of non-global logs has been dealt with in the literature, for example, in the case of gaps- 
between-jets E t flow [21,22]. We expect the gross features of the latter to hold for the jet 
mass observable. Essentially, the impact of kt clustering is in such a way as to reduce the 
size of the anti-A^ (non-clustering) non-global logs. Brief comments on such a reduction 
will be given in sec. |6.1| . In this paper we are, however, mainly interested in the function 

92,a( r2 /p)- 

3. Two-gluon emission calculation 

The effect of k t and C/A clustering logs starts at 0(a 2 s ). At 0(a s ) there is indeed a 
dependence on the jet radius but this dependence is also present for the anti-fcj algorithm 
and has been dealt with in ref. [28]. In this regard we begin with two-gluon emission case 
and study the effect of clustering using both the C/A and kt clustering algorithms. This 
calculation has already been performed in ref. [28] for the kt algorithm in the small- R limit. 
Here we perform a full-i? calculation of the logs coefficient. 

3.1 Calculation in the k t algorithm 

Consider the independent emission of two soft energy-ordered gluons ki <C k\ <C Q, where 
Q is the hard scale. This regime, i.e. strong energy ordering, is sufficient to extract the 
leading clustering logs. 

In the case of the anti-A^ algorithm discussed in ref. [28] the only contribution from 
these gluons to the jet mass differential distribution is when both of them are emitted 
within an angle R from the hard parton initiating the measured jet. This is because the 
anti-fcj algorithm works in the opposite sense of the kt algorithm: clustering starts with the 
hardest particles. In this sense the algorithm essentially works as a perfect cone around 
the hard initiating parton with no dragging- in or dragging-out effect. 

In the kt algorithm, on the other hand, the two gluons may be in one of the following 
four configurations: 

1. Both gluons k\ and ki are initially, i.e. before applying the algorithm, inside 3 the 
triggered jet. This configuration contributes to the jet shape (mass) regardless of 
clustering. The corresponding contribution to the jet mass distribution is identical 
to that of the anti-£^ case (accounted for by £ antl ~ kt - see eq. ( |3.2| ) below). 

3 We use inside, or simply "in" , to signify that the parton is within an angular separation of R from the 
hard triggered parton (jet axis), and outside, or simply "out", if it is more than R away from it. 
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2. Both gluons are initially outside the triggered jet. This arrangement does not con- 
tribute to the jet shape regardless of whether the two gluons are clustered or not. 

3. The harder gluon, ki, is initially inside the triggered jet and the softer gluon, k 2 , is 
outside of it. The value of the jet shape (mass) is not changed even if clustering takes 
place. This is due to the strong-ordering condition stated above. 

4. The harder gluon, fei, is initially outside the triggered jet and the softer gluon, k 2 , 
is inside of it. Applying the algorithm one finds that a real- virtual mis-cancellation 
occurs only if k 2 is pulled-out of the triggered-jet vicinity by k%, a situation which is 
only possible if k 2 is "closer" (in terms of distances dij ) to k\ than to the axis of the 
jet. 

We translate the latter configuration mathematically into the step function: 

~ 2 (k u k 2 ) = e(0? - r 2 )&(r 2 - elmel - 0? 2 ), (3.i) 

where Oi is the angle between gluon ki and the jet axis and 0\ 2 is the relative angle of 
the two gluons. The above condition is valid only in the small- R limit and we extend this 
to the full i?-dependence in appendix [A]. Hence E2 = 1 when configuration |I| above is 
satisfied and H2 = otherwise. While in the case where the gluon k\ is virtual and k 2 
is real (meaning that k\ cannot pull k 2 out) the particle k 2 contributes to the jet mass. 
To the contrary, when both k\ and k 2 are real, then k\ will not allow k 2 to contribute to 
the jet mass as it pulls it out. Thus a real- virtual mismatch occurs and a tower of large 
logarithms appears. To calculate these large logarithms we insert the clustering condition 
E 2 above into the phase-space of the 0(a 2 s ) integrated jet mass distribution. The latter, 
normalised to the Born cross-section o"o is given, in the soft and collinear approximation, 
by: 

Y% (R 2 / p ) = S anti-k t ^ R2/p j + s clus ^ R 2 j ^ ^ 

Y>t s {R 2 /p) = ^(-^v)V ^Hafe.fe), (3.2) 

with 

dcoi d cos 9i d(f>i f 4ui \ dxi dfa d9f 2 

dPi = r^- G — 1 - cos#i - p ~ — 2 e{xi9i - p), (3.3) 

where Qi, 4>i and Xi = 2uji/Q are the polar angles, with respect to the jet axis, and the 
energy fraction of the i th gluon. The factor 1/2! in eq. ( |3.3| ) compensates for the fact that 
we are considering both orderings: x 2 <C xi <ti Q and x\ <C x 2 <C Q. Had we chosen to 
work with only one ordering, say X2 C ii < Q as stated at the beginning of this section, 
then the said-factor would have not been included. In terms of the coordinates (0, <fi, x) the 
jet mass fraction defined in eq. ( p.l| ) reduces to (in the small-angles limit): 

P= -ft(I-cos^) vxiOl (3.4) 
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where gluon i is in the jet. In eq. ( j3li| ) we used the step function to restrict the jet mass 
instead of Dirac-5 function because we are considering the integrated distribution instead 
of the differential one. Hence at two-gluon level, the correction term due to kt clustering 
(eq. ( |3.2| )) is given by: 

lus = i fc^V r^dx, r del del r #i@ M 2_ )x 
2 2! V 7T J J Xl x 2 J e\ e 2 2 J_ w 2tt y 1 P} 

x ®{ X2 e\ - P )Q(el - r 2 )q{r 2 - el)Q{e 2 2 - e 2 12 ), (3.5) 

where we used our freedom to set (fi 2 to 0. We write the result to single-log accuracy as: 

S^ = i(-^)V 2 L 2 , (3.6) 

where L = In (R 2 /p). In the above equation we ignored sub leading logs and used the 
fact that in the small-angles approximation: 0\ 2 = Of + 2 — 2Q\0 2 cos 4>\. The two-gluon 
coefficient T 2 is given by: 

T 2 = - J 3 # In 2 (2 cos <p) = ^ « 0.183. (3.7) 

As stated at the outset of this section, we can actually compute T% beyond the small- 
angles (thus small- R) limit. In appendix [A] we present an analytic calculation of this 
clustering coefficient as an expansion in the radius parameter. One observes that the 
small- R approximation of F 2 , given in eq. (|3.7|) , is actually valid for jet radii up to order 
unity because of its slow variation with R (first correction to the small- R result is of 0(R 4 )). 
For instance, at R = 0.7 and R = 1.0 the coefficient is J- 2 = 0.188 and 0.208 respectively, 
i.e. an increment of about 3% and 15%. We compare the analytical formula with the full 
numerical result in Fig. ||. 

3.2 Calculation in the C/A algorithm 

At the two-gluon level, order a 2 in the perturbative expansion of the shape distribution, the 
C/A and kt algorithms work essentially in a similar manner. Although the C/A algorithm 
clusters partons according only to the polar distances between the various pairs (recall 
p = in Eqs. Q2.2j ) and (|2.3|)), at this particular level energies do not seem to play a role 
(once they are assumed strongly-ordered). The jet shape is only altered if the softer gluon 
k 2 is (geometrically) closer to k\, or vice versa for the opposite ordering, than to the jet 
axis, so as to escape clustering with the jet. This similarity between the two algorithms 
does not hold to all orders though. We shall explicitly show, in the next section, that they 
start differing at O(a^). 

4. Three and four-gluon emission 

4.1 Three-gluon emission 
4.1.1 k t clustering case 

Consider the emission of three energy-ordered soft gluons Q S> k\ ^> k 2 ^> k%. We proceed 
in the same way as for the two-gluons case. First we write the step function ^3(^1, k 2 ,k^), 
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which describes the region of phase-space that gives rise to clustering logs. To this end, 
applying the kt clustering algorithm yields the following expression for H3: 

3 3 (A;i, k 2 , k 3 ) = Q(R 2 - 6 2 )Q(9 2 - R 2 )Q(R 2 - 61)9(61 - 6 2 23 ) + h o k 2 + 

+Q(R 2 - 9 2 )Q(6 2 - R 2 )Q(6 2 - R 2 )Q(9 2 - 9 2 23 )Q{9 2 - 6 2 3 ) + 

+e(R 2 - 9 2 )e(R 2 - 9 2 )Q{6 2 - R 2 )@(9 2 3 - 6 2 )Q{9 2 23 - 6 2 )Q{6 2 - 6 2 2 ). 

(4.1) 

Hence the correction term due to kt clustering at three-gluon level, X3 lus , which is of an 
analogous form to S2 lus (eq. ( |3.2[) ), is: 

^clus 



1 / C F a s \ 3 f 1 dxx dx 2 dx 3 f d9\ d9\ d9\ [* dfa d<\> 2 



J3 " 3! V vr ; J Xl x 2 x 3 J 9\ 9 2 9 2 J_ w 2vr 2vr 

x Q( Xl 9 2 - P )Q(x 2 6 2 - P )e(x 3 6 2 -p)x E 3 (9 1 ,9 2 , 9 3 , fa, 2 ), (4.2) 

where, as in S2 lus , we used our freedom to set 4> 3 = 0. Performing the integration, in the 
small-i? limit, the final result may be cast, to single-log accuracy, in the form: 



S 



clus 
3 



1 / Cpa s 



3! V vr 



3 x 2L 

'■ 2 iv- +^3 



T 2 L 2 + T 3 L Z 



(4.3) 



where the three-gluon coefficient T 3 = —0.052. Extending the formalism developed in 
appendix |A| to the case of three gluons, it is possible to write down an expansion of J- 3 in 
terms of R, just as we did with T 2 . However, since (a) the whole clustering logs correction 
to the anti-kt result is substantially small, as we shall see later in sec. [6|, and (b) IJ^I 
is much smaller than J- 2 , we do not perform such an analytical calculation here. We do 
perform a numerical evaluation of the full-iZ dependence of T 3 for various values of R, 
though. The final results are provided in table |l]. 

The first leading term in eq. ( |4.3| ) is the product of the one-gluon DL leading term 
in the anti-fc^ algorithm, a s L 2 , and the two-gluon SL term of eq. ( p.6[ ). The second NLL 
term in eq. ( [4.3[ ) is the new clustering log at O(a^). We expect that at n th order in a s 
new clustering logs of the form T n L n emerge. Notice that \J~ 3 \ < J- 2 , indicating that the 
series T n L n rapidly converges. Consequently, the two-gluon result is expected to be the 
dominant contribution. This expectation will be strengthened in the next section, where 
we compute the four-gluon coefficient J-4. Before doing so, we address the three-gluon 
calculation in the C/A algorithm. 

4.1.2 C/A algorithm case 

We follow the same procedure, outlined above for the kt algorithm, to extract the large 
logarithmic corrections to the anti-kt result in the C/A algorithm. As we stated before the 
algorithm deals with the angular separations of partons only. In this regard we consider 
the various possibilities of the angular configurations of gluons and apply the algorithm 
accordingly, to find a mis-cancellation of real-virtual energy-ordered soft emissions. The 
clustering condition step function in the C/A algorithm can be expressed as: 

^C/A _ „ fct 5 



J 3 — ^3 



(4.4) 
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where Hg* is given in eq. ( |4.1[) and the extra function reads: 

s 3 = e(jR 2 _ 0| )e( 02 _ fl 3)e ( ^ a _ el)Q{el - e 2 3 )e(e 2 - e 2 2 )e(e 2 23 - e 2 2 ). (4.5) 

Inserting the step function eq. (fO|) into the equivalent of eq. (|4.2| ) for the C /A algorithm 
one obtains: 

s clus,C/A = s clu S + 1 (-QpL\ 3 ^3 L\ (4.6) 

where SS 1 us is given in eq. (|4.3|) and J-3 = 0.0236. Hence the factor T 3 ^ which replaces the 

C/A ~ 

kt clustering term T% is, in the small- R limit, K , = 7" 3 + 7" 3 = -0.028. We provide the 
full-it! numerical estimates of J~ 3 ^ in table ED. This result illustrates that the contribution 
to the shape distribution at this order (a 3 ,) in the C/A clustering is approximately half 
that in the kt clustering (although the values are, in both algorithms, substantially small). 
Moreover, it confirms the conclusion reached-at in the kt algorithm case, for the C/A 
algorithm, namely that the clustering logs series are largely dominated by the two-gluon 
result. Next, we present the calculation of the four-gluon coefficient J- 4. 

4.2 Four-gluon emission 

Consider the emission of four energy-ordered soft primary gluons Q k\ S> k 3 S> k±. 
The determination of the clustering function H4 is more complex than previous lower orders, 
particularly for the C/A algorithm. As a result of this we only present, in this paper, the 
findings for the kt algorithm. The four-gluon calculations for the jet mass variable are 
very much analogous to those presented in ref. [22] for the energy flow distribution, to 
which the reader is referred for further details. Here we confine ourselves to reporting on 
the final answers. After performing the necessary phase-space integration, which is again 
partially carried out using Monte Carlo integration methods, and simplifying one arrives 
at the following expression for the correction term, to the anti-fet shape distribution, due 
to kt clustering: 

Sf s = - ("^) 4 x {e x ^ x T 2 L 2 + 4 x E x J-3L 3 + 3 x (F 2 L 2 ) 2 + F±A , (4.7) 

where, in the small-/? limit, T± = 0.0226. The full-i? numerical results are presented 
in table ||. As anticipated earlier we have T4 < IJ-3I <§C F2, thus confirming the rapid 
convergence behaviour of the clustering logs series. 

eq. ( [4.7|) contains products of terms in the expansion of the Sudakov anti-fcj form 
factor with the two- and three-gluon results, Eqs. (|3.6| ) and ( |4.3| ), as well as the new NLL 
clustering term at 0{cQ). The first leading term in eq. ( [4.7] ) comes from the product 
a 2 L 4 x two-gluon result (a^J^L 2 ); the second term comes from the product a s L 2 x three- 
gluon (a^J-3L 3 ) result; and the third term is the square of the two-gluon result. Therefore, 
the three- and four-gluon expressions, Eqs. ( fO] ) and ( |4.7| ), seem to suggest a pattern 
of "exponentiation". Such behaviours give rise to the intriguing possibility of finding a 
reasonably good approximation to the full resummation of clustering logs to all orders, the 
task to which we now turn. 
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5. All-orders result 



In analogy to the work of ref. [22], one can see that the results obtained at 2, 3 and 4-gluon 

(C /A) 

can readily be generalised to n-gluon level, with new terms of the form J 7 ^ L n appearing 
at each order for the kt (C/A) algorithm. By similar arguments to those of ref. [22] we can 
deduce the leading term in the n th order contribution due the kt (C/A) clustering to the 
shape distribution, vj^ lus ( c / A )_ re ads: 

Summing up the terms vj^ luh ( c / A ) to all orders, i.e. from n = 2 to n — > oo, yields the 
following resummed expression: 

S cius(c/A) K exp |_^£^|^ (9^1 A\ ( 5. 2) 

The first exponential in the above expression is the celebrated Sudakov form factor, that 
one obtains when resumming the jet mass distribution in the anti-/cj algorithm. Due to its 
Abelian nature, the Sudakov is entirely determined by the first primary emission result. 
There are also other pure Ti terms in vjcius(C/A) f or n > 4 Q f ^he form: 

^"* C/A) « {~T Vt {~ L )'- < 5 ' 3 > 

which can be resummed to all orders into: 

S cius(c/A) x exp (_^£^!l 3_ (_9e2Lt\\ (5.4) 



7T A 8 V 7T 



From Eqs. (5.2) and fl5.4|) , one anticipates the resummed result to all orders in the clustering 



log, L, to be of the form: 

Furthermore we have the following expression in £^ lus ^ C//A ^: 



f C F a s 






oc exp < 


t} 


exp | 







(5.5) 



which is resummed into: 

scmc/a)^ exp r_^^|^^/_^ L y (5J) 

Similarly, one expects analogous expressions to eq. (fTs]) for the remaining JF%, 
■ ■ ■ terms, in addition to "interference terms" between these coefficients, e.g. which 
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should first show up at C(a^L 5 ). Recall that the shape distribution in the kt (and C/A) al- 
gorithm is, in the Abelian primary emission part, the sum of the distribution in the anti-A^ 
algorithm (clustering-free distribution) and a clustering-induced distribution. Schemati- 
cally: 

yjfct(C/A) j-janti— kt _j_ j-iclus(clus,C/A) /g g\ 

where £ antl_kt is simply the Sudakov form factor mentioned above. Thus gathering every- 
thing together and including the logs which are present in the anti-fet case the following 
exponentiation is deduced: 



yjfc,(C/A) 



where J-^^ A ^ is the n th -gluon coefficient in the kt (C/A) algorithm. 

Confined to the anti-fct jet algorithm, the authors in [28] computed the full resummed 
jet mass distribution up to NLL accuracy, including the effect of the running coupling as 
well as hard collinear emissions 4 . Taking these and the fixed-order loop-constants into 
account, eq. ( |5.9| ) becomes: 

S fct(C/A) = ^ + E c ™"™) ex P i L 9i(»sL) + g 2 (a s L)] exp [g k 2 f /A) (a,L)] , (5.10) 

where a s = a s /2ir and the functions g\ and 52 resum the leading and next-to-leading logs 
occurring in the anti-fet case. Their explicit formulae are given in [28]. The new piece in the 
resummation which is due to primary-emission clustering and which contributes at NLL 
level is: 

9 k 2 f /A \a s L) = J2^_ 4 C/A) {-2C P tr , (5.H) 

n 

where we have introduced the evolution parameter t, which governs the effect of the running 
coupling: 

1 f Q dk t . 1 



t = — -^a s (k t ) = -—Hl-a s f3 L), (5.12) 

27r Jqvp/r k t 4lT Po 

where the last equality is the one-loop expansion oft, and we have /3o = (HCa — 2n/)/127r. 

In the present work, we have been able to compute, by means of brute force, the first 
three coefficients, ^=2,3,4, for the k% algorithm and only the first two coefficients, J-^^, 
for the C/A algorithm. They are, nonetheless, sufficient to capture the behaviour of the 
all-orders result, for a range of jet radii, as we shall show in the next section where we 
compare our findings to the output of a numerical Monte Carlo program. 

6. Comparison to MC results 

The MC program we use was first developed in [19] to resum non-global logs in the large- N c 
limit, and later modified to include the kt clustering in [21,22]. Since the MC program was 



'Leaving non-global contributions aside for now. 
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Figure 1: Comparisons of the analytical result to the output of the Monte Carlo program in the 
kt algorithm for two values of the jet radius. 



originally designed to resum soft wide-angle emissions to all orders, it only resums single 
logs. The leading logs in the jet mass distribution are, however, double logs. As such one 
cannot produce the corresponding DL Sudakov form factor with the MC. Hence it is not 
possible to directly compare the output of the MC with eq. ( |5.10| ), in order to verify the 
analytical calculations of 52, A- One can, however, extract the MC resummed clustering 



function, exp 



92,A 



, by subtracting off the result with clustering "switched off" from that 



with clustering "switched on". The remainder is then directly compared to exp [52, a] > where 



92,A is given in eq. (|5.1l[) . Such comparisons are presented in fig. [y. 

The plots display the MC estimate of exp (#2 j4.)> the analytical result of the latter in 
the cases where (a) only the first coefficient T% is included in the sum (|5TT| ) and (b) the 
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Figure 2: The output of the Monte Carlo program in the k t algorithm for various jet radii. 



first three coefficients, ^=2,3,4, are included. The dependence of the clustering coefficients 
T% on R, given in table |], is taken into consideration. 

One can clearly see that the function exp (52, a) is largely dominated by the first coeffi- 
cient J- 2, with minor corrections from T% and T±. For instance for R = 1.0 the J~2, J-3 and 
J-4 coefficients (put alone in turn) induce a correction to the anti-fcj resummed distribution 
of 1.6%, 0.06%, 0.002% and 4.8%, 0.3% and 0.02% for t = 0.15 and 0.25 respectively. This 



can be understood from eq. (5.11): in addition to being smaller than J-2, the higher-gluon 
coefficients T n (n > 3) are suppressed by a factorial factor (n!), thus leading to a fast 
convergence. Given the agreement between our analytical estimate and the output of the 
Monte Carlo, and given that the function exp(<?2,A) contributes at most 0(5%), we con- 
clude that our results for the resummed clustering logs are phenomenologically accurate 
for jet radii up to order unity, and that missing higher-order coefficients T n (n > 5) are 
unimportant. 

Lastly, we plot in fig. ^ the MC results of the function exp(g2,A) for various jet radii. 
The plots unequivocally indicate that for a fixed t, say 0.15, the function g2,A varies very 
slowly with R for R < 1 and grows relatively rapidly as R > 1. Such a behaviour may 
be explained by the analytical formula of J- 2, eq. ( A.lOj ), where the first correction to the 
small-i? result is proportional to R . 



6.1 Non-global logs 

As a final task, we plot in Fig 
eq. (I 



the full resummed jet mass distribution in the kt algorithm, 
4|), including the non-global factor S{t) in the large- N c limit, for various jet radii. In 
the figure, "Sudakov" refers to the Sudakov anti-fej form factor, vj antl - k t 5 "primary" refers 
to the primary form factor in the kt algorithm containing the clustering logarithms, 
(eq. ( |5.10D ), "Full: anti-fcj" refers to the full anti-fcj resummation, £ antl ~ kt ^anti-kt ^ an( j 
"Full: kt" refers to Y, kt S kt (t). We notice that the inclusion of non-global logs leads to a 
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noticeably large reduction of the full resummed result for the anti-fc^ algorithm case, while 
in the kt algorithm the reduction is moderate. 

As is well known by now [21-23,37], clustering reduces the impact of non- global logs 
through restricting the available phase-space for their contribution. This is also the case 
with the jet mass distribution where we note, for instance, that for t = 0.15 the impact 
of non-global logs in the anti-fc^ algorithm (for R = 0.1, 0.7 and 1.0) is a 24% reduction of 
the global part, while in thekt clustering case this is merely 7% (for R = 0.1 and R = 0.7) 
and 4% for R = 1.0. We further notice that for small values of the jet radius, S(t) is 
independent of R. While this is true for all values of R in the anti-fcf algorithm 5 , in the 
kt algorithm S falls down as R becomes larger. This is evident in the R = 1.0 plot in Fig. 

1 

7. Conclusions 

In this paper we have considered the possibility of exponentiation of clustering logs in the 
kt and C/A algorithms. We have found, by explicit calculations of the first few orders (up 
to 0(aj,), and including, for the first time in literature, the full jet radius dependence), 
that the perturbative expansion of the invariant jet mass distribution exhibits a pattern of 
an expansion of an exponential. Consequently we were able to write an all-orders partially 
resummed expression for primary emission clustering logs. We further checked our formula 
against the output of a numerical Monte Carlo and found a good agreement, within the 
accuracy of our calculations. We have therefore concluded that missing higher-order single- 
log terms in our resummed distribution have a negligible impact on the total resummed 
distribution for typical values of jet radii (up to order unity). 

Furthermore, we have briefly discussed the impact of the inclusion of non-global logs 
on the total resummed distribution. We confirmed previous observations concerning the 
facts that (a) non-global logs reduce the Sudakov peak of the distribution and that (b) 
such an impact is diminished when clustering is imposed on final-state particles. 

We note that the calculations we performed here can readily be generalised to a large 
class of non-global observables defined using the kt or C/A algorithm, where the observable 
is sensitive to soft emissions in a restricted region of phase space, e.g. angularities [35]. 
Since the calculations of the coefficients T n presented here are in fact independent of 
the jet shape and depend only on the angular configurations introduced by the clustering 
algorithm, then the effect of jet clustering can simply be included for any generic observable 
v (being sensitive to soft and collinear emissions inside the jet only) by introducing the 
exponential function exp[<72,A]> with exactly the same coefficients J- n we computed here. 
The only difference is the argument of the logarithm essentially becoming R 2 /v. 

Although we have confined ourselves to studying the single jet mass distribution in 
e + e~ annihilation, the extension of our work to, e.g. monojet production at the LHC 
(such as Z+ jet, which is an important channel in looking at BSM physics, and for which 
a calculation in the anti-^t algorithm has recently been performed [39]) can readily be 

5 <S antl ~ kt (t) for our jet mass is identical to that for the hemisphere jet mass case considered in [19]. 
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performed. In [38] we consider the extension of the work of ref. [39], which is carried out 
in the anti-&t jet algorithm, to the case where final-state jets are defined in the kf (and 
C/A) jet algorithms. We employ the techniques developed for e + e~ colliders in this paper 
to compute the full i?-dependent resummed clustering logs as well as non-global logs for 
hadron colliders. Although the calculations for the latter are much more involved, no major 
deviations from the overall picture drawn at the current paper are anticipated. 
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A. Full independence of clustering coefficients 

Here we present a calculation of the dependence of the coefficient of the clustering logs T 2 
away from the small-angles (thus small- R) approximation. To do so, it is easier to work 
with transverse momentum, (pseudo-)rapidity 6 and azimuthal angle with respect to the 
beam axis, (k t ,T],fa), variables instead of energy and polar angles, as performed in sec. ||[ 
We also specialise to the threshold limit in which the trigged jet is created at 90° to the 
beam (which is along the z-axis). Our calculations can straightforwardly be extended to 
the case where the triggered jet is at an arbitrary rapidity 7 . We parametrise the outgoing 
four-momenta as: 

Pl = 1(1,1,0,0), 
p 2 = |(1, -1,0,0), 

ki = h i (cosh rji, cos fa, sin fa, sinhr/,), (A.l) 

with i = 1, 2 for the two gluons respectively. In terms of the new variables, the clustering 
function (eq. (|3.1[ )) reads: 

E 2 (h, k 2 ) = 8(d lj - R 2 )e(R 2 - d 2j )G(d 2j - d l2 ), (A.2) 

with 

dij = vl + <pj, d 2j = V l + ^l d 12 = {r h - m f + {fa-faf. (A.3) 
In this coordinate system the jet mass becomes: 

2pi.ki Ak t i ( cos(/>j \ 

9 = ~m = V cos ^ " cos<w = 2Xi \ l " ^ ) ' (A - 4) 

when particle ki is recombined with the triggered p\ jet. In the above we expressed the jet 
mass in terms of the energy fraction Xi = 2uJi/Q, with Ui = ku coshr/j. 

6 Recall that r\ = — lntan(6l/2) and k t = ujs'm9. 

7 We perform such a calculation in [38] for each dipole. 
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The probability of a single virtual soft gluon correction is given by: 
, r d% 2 2{p 1 .p 2 ) C F a s dxi dfa 1 

dT i = -7, — 77T-^9s C F-, TT7 TT = dr l T2 5Tj ( A - 5 ) 



where we note here that the collinear limit to the triggered jet p\ corresponds to rji — > 
cmc? — )■ 0. 

Thus we can write the correction term due to clustering (eq. Q3.2|) ) as: 

£f = i / ^.e - 2(cos ,:^ os , i) p) W. (A.6) 

Performing the energy-fraction integration yields an expression identical to eq. ( |3.6D with 
the full .R-dependent coefficient, T 2 (R), given by: 

If 1 1 

^2{R) = -9 / dr}id^idr) 2 d^2 5- To 5—^2(^1,^2). (A.7) 

vr^ j cosh 771 — cos z <p\ cosh 7/2 — cos z 02 

This integration can be performed numerically to extract the value of T 2 for arbitrary 
R. However it proves useful, in the simple case of T 2 in order to obtain an analytic 
expression at least as a power-series, to introduce the polar variables r and a defined by: 

77 = r cos a, (p = rsina. (A. 8) 



We rewrite eq. (|A.7p as: 

roo fir rR 



1 pOO pTT pit p7V 

T 2 = —x I r\dr\ / da% / r 2 dr 2 / da 2 Q(2r 2 cos(a 2 - a\) - r%)x 

* JR J-n JO J -it 



1 1 \ 9 

cosh 2 (r\ cos ax) — cos 2 {r\ sin ax) cosh 2 (r 2 cos a 2 ) — cos 2 (r 2 sin a 2 ) 

We note that one can expand the second line of eq. ( |A.9| ) in powers of r,- L so as to 
write the result as a power-series in R. To first order, the second line expands as l/r 2 l/r 2 . 
We thus immediately identify this integral as the one for the small-angles approximation, 
whose result is 7r 2 /54 ~ 0.183. Performing higher-order integrals of the expansion of the 
integrand yields the following result: 

F 2 {R) = 0.183 + 0.0246i? 4 + 0.00183i? 8 + 0.000135i? 12 + 0(R 16 ). (A.10) 

This expansion is actually valid for values of R up to order unity, a claim which is backed-up 
by fully performing the integral ( |A.9| ) numerically via Monte Carlo methods and comparing 
to the analytical estimate ( [A.IOP , fig. ||. We note that the clustering coefficient varies very 
slowly with R. 

(C /A) 

Finally we note that the calculation of J3 and J-4 may be performed in the same 
way. Schematically we have: 

1 I n \ 

Fn(R) = — / TT*?i#i ro 7— E n (ki,k 2 ,--- ,k n ), (A.ll) 

vr" J -V- 1 - cosh rji — cos^ fa 

where the step function E n is expressed in terms of the distances di = rjf + (f)f and dij = 
Srjfj + d4>ij, respectively replacing 9i and Oij in, e.g. eq. (|4.lD . We provide full numerical 
estimates for all coefficients in table |l|. 
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R 





0.1 


0.4 


0.7 


1.0 


1.2 




0.183 


0.184 


0.184 


0.188 


0.208 


0.242 


^3 


-0.052 


-0.053 


-0.053 


-0.055 


-0.061 


-0.072 


^C/A 
•^3 


-0.028 


-0.029 


-0.029 


-0.029 


-0.030 


-0.031 


^4 


0.022 


0.023 


0.023 


0.023 


0.024 


0.027 



Table 1: Estimates of the clustering coefficients T n for n = 2,3,4 at different values of the jet 
radius R in the k t and C/A algorithms. Recall that ^ A = T^(^ T-i) and that has not 

been computed. Note that J~^ A and T± are more difficult to evaluate numerically than J-3 (and 
obviously T-i). Thus their final answers are subjected to larger errors. 
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Figure 3: Comparisons of the Sudakov result, the correct primary result and the full result in- 
cluding non-global logarithms with and without clustering, as detailed in the main text. 
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Figure 4: Full i?-dcpcndcnce of the two-gluon clustering coefficient Ti. 
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